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ABSTRACT 


Fluxes emitted at different wavebands from active galactic nuclei (AGNs) fluctuate at both long and short 
timescales. The variation can typically be characterized by a broadband power spectrum, which exhibits a red- 
noise process at high frequencies. The standard method of estimating power spectral density (PSD) of AGN 
variability is easily affected by systematic biases such as red-noise leakage and aliasing, in particular, when the 
observation spans a relatively short period and is gapped. Focusing on the high-frequency PSD that is strongly 
distorted due to red-noise leakage and usually not significantly affected by aliasing, we develop a novel and 
observable normalized leakage spectrum (NLS), which describes sensitively the effects of leaked red-noise 
power on the PSD at different temporal frequencies. Using Monte Carlo simulations, we demonstrate how an 
AGN underlying PSD sensitively determines the NLS when there is severe red-noise leakage and thereby how 
the NLS can be used to effectively constrain the underlying PSD. 


Subject headings: galaxies: active — methods: numerical — methods: data analysis — methods: statistical 


1. INTRODUCTION 


Active galactic nuclei (AGNs) are powered by accretion 
of matter onto supermassive black holes at the centers of 
galaxies. Their fluxes vary intrinsically at a wide range of 
timescales across a broad range of wavebands, among which 
X-ray continuum emission from a postulated corona shows 
the most rapid aperiodic variability. Optical emission, on the 
other hand, varies significantly at longer timescales. Power 
spectral density (PSD) is routinely used to study the distribu- 
tion of variability power P(f f] across temporal frequency f. 
The X-ray PSD of AGNs is often modeled by a broken 
power law (see the sketch in Fig. II, which usually starts 
with P(f) x 1/f% (8 ~ 1) at low frequencies, and steepens 
to P(f) x 1/f* (a > 2), being a type of red noise, above 
eae g Unley tal 2002 
2004). The optical PSD seems to be 
steeper than the X-ray one at the corresponding high frequen- 
cies, with a power-law-like shape of index around or larger 
than 3 (e.g. [Mushotzky et al [201 1). It may also flatten to an- 
other power law with index around 2 below a break frequency 
(Edelson et al||2014), and has to eventually flatten further at 
another low-frequency break to make power converge. 

The break frequency of the X-ray PSD corresponds 
to a characteristic timescale, which scales linearly with 


mass of black hole and possibly inversely with accre- 
tion rate (McHardy et al (2004 |2006; [Kelly et al) |2011; 
Gonzélez-Martin & Vaugha . This scaling relation can 


even extend to stellar-mass black hole accreting systems. The 
amplitude of high-frequency PSD (McHardy||2013} and ref- 
erence therein) or the integral of that part (e.g., Zhou et al. 
also scales inversely with black hole 
mass, which makes them potential black hole mass estima- 
tors. Recently, a method of measuring cosmological dis- 
tance using amplitudes of X-ray variability of AGNs at short 
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PSD with P to discriminate it from periodogram that is denoted as P in the 
subsequent text. 


timescales was discussed (La Franca et al.||2014). In addi- 
tion, the minimum timescale of variation is often used to con- 
strain the size of the emitting region based on light cross- 


ing time arguments in the studies of X-ray (eg, 2004; 
2005) and 7-ray (e.g.,|Abdo et al.||(2011) emission 


from blazars. The minimum variation timescale should cor- 
respond to the end of the high-frequency power law of the 
PSD, if such most rapid flares have the same physical origin 
as flares of longer durations. Multi-energy band timing stud- 
ies usually reveal more physical information than monochro- 
matic timing studies. However, verification of multi-energy 
band correlations and measurement of time lags between var- 
ious bands still demand determination of PSD of each band 
[Uttley et al.[2014. 

PSD, defined as the Fourier counterpart of the au- 
tocovariance function (ACF) of a continuous process 
(Timmer & Koenig) |1995), is standardly estimated by the 
squared-modulus of discrete Fourier transform of an ob- 


served lightcurve, i.e., a periodogram. Because a periodogram 
is not a consistent estimator of PSD (Vaughan et al] (2003) 
binning over frequencies and/or segments is usually neces- 
sary Papadakis & Lawreneel1999) But a 
binned periodogram still suffers from distortions such as red- 
noise leakagd'and aliasing caused by power leaking from be- 
yond the frequency range covered by the observation. Red- 
noise leakage can conceal the underlying PSD, when the con- 
tinuous lightcurve is shorter than the characteristic timescale, 
which is usually the case of real AGN observations. Such 
observations thus only cover the high-frequency part of PSD, 
inevitably being affected by red-noise leakage. 

Although several techniques have been used to allevi- 


ate this systematic distortion, the issue of red-noise leak- 
age is far from being solved perfectly. One such technique 


3 For a red-noise process (e.g., P(f) « 1/f% with a > 2), the variance 
Oe, integration of P(f)) is dominated by longterm variations. The large 
amount of variability power being distributed at low frequencies makes a 
short lightcurve display some longterm variation trend. This longterm vari- 
ability power is transferred from low frequencies to the high-frequency PSD 
(thus termed vividly as red-noise leakage), thereby distorting the intrinsic 
high-frequency PSD shape. The severity of red-noise leakage depends both 


on the shape of the underlying PSD and the length of the observation (See 
and reference therein). 
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FIG. 1.— The broken power law that is used to model the PSD of AGN 
X-ray variability and adopted in our subsequent lightcurve simulations. a, 3, 
T, are the high-frequency PSD slope, low-frequency PSD slope, and charac- 
teristic timescale, respectively; 8 is fixed to 1.0 throughout the simulations. 
At 1/Ty, the power of intrinsic variability equals that of the observed white 
noise. AT is the time interval between consecutive samplings, and 1/(2AT) 
is the highest frequency of the PSD, above which all power is smeared out. 
Note that since there is supposed to be a very long timescale, above which the 
variability of AGNSs is no longer correlated, we plot this intrinsic white noise 
together with part of the low-frequency power law using dotted lines. Note 
that such a low-frequency break is physically required to make the variabil- 
ity power converge at the low-frequency end unless the low-frequency slope 


B < 1.0; observationally, there is indeed one AGN whose PSD manifests this 
break (McHardy et al12007). 


is called “end matching”, which was proposed by [Fougerd 
(1985) and has still been used in many recent works (eg, 
Gonzalez Marti & Vaughan DOLD. 
It removes the longterm linear trend from the lightcurve by 
making the first and last data points level, with a purpose 
to suppress the low-frequency power leaked into the high- 
frequency PSD. Although somewhat arbitrary, this operation 
represents the simplest attempt to estimate the underlying 
PSD with limited observational data. 

(see also proposed the 
use of a damped random walk (DRW) model to describe the 
optical variability of AGNs. More recently,|Kelly et al] 
developed the continuous-time autoregressive moving aver- 
age (CARMA) model to constrain the PSD of a lightcurve. 
The advantage of the above two models is that they fit the 
lightcurve directly, thus being free from sparse sampling. 
However, some recent works (eg. 
based on Kepler observations indicated 
that the optical high-frequency PSD of AGNs is rather steep, 
deviating from a simple DRW process. The CARMA model, 
on the other hand, can flexibly fit many PSDs. But the model 
parameters do not have a specific physical meaning in many 
applications, and CARMA will always introduce additional 
artificial Lorentzian features in the PSD, thus being unable 


to generate the exact PSD shape as shown in Fig. [1] (e.g., 
[Edelson et al2014) 

Another technique is called “tapering”, i.e., using a ta- 
pered window to make the lightcurve go to zero near its end 
points more gradually than a sudden drop to zero 
O88), discussed the use of Han- 
ning sampling window function to reduce the effects of red- 
noise leakage of unevenly-sampled time series. Their meth- 


ods included Monte Carlo simulations 
and linear fitting of windowed periodograms. How- 
ever, they only considered PSD in the form of a simple power 
law, and thus did not discuss further the dependence of PSD 
on many additional parameters such as lightcurve length and 
break frequency. 

To resolve the issue of red-noise leakage and uncover 
the underlying high-frequency AGN PSD without altering 
lightcurve data and without relying on any specific physical 
models, we propose, in this paper, a novel and observable nor- 
malized leakage spectrum (NLS) to describe sensitively the 
effects of red-noise leakage on PSD and thereby constrain the 
underlying AGN PSD. Note that, in this paper, we only con- 
sider a stationary process, i.e., the underlying PSD remains 
unchanged during observations. The structure of this paper is 
as follows. We first present the standard method to estimate 
PSD, the method of time-series simulation, and the effects 
that distort PSD in Section[2] We then introduce the so-called 
NLS and examine how it is determined by the underlying PSD 
in SectionB] Subsequently, we describe our approach of con- 
straining PSD utilizing NLS in Section|4] Finally, we discuss 
further NLS and its applications in Section[5] and summarize 
our results and draw our conclusions in Section[6] 


2. STANDARD METHOD, SIMULATION, AND BIASES 


Continuous observations of particular targets are usually 
short, lasting minutes to hours. Optical observations made 
with ground-based telescopes are limited to the length of a 
night, or the visibility of the target on the sky. Space tele- 
scopes may have their optical/X-ray observations interrupted 
due to periodic blocking of the telescope’s view of the target 
by the Earth, gaps in the telemetry (Vaughanl|2013), or de- 
tector shutoff during the telescope’s passage over the South 
Atlantic Anomaly area. As a result, two types of observa- 
tional data have to be treated differently. A single short con- 
tinuous observation corresponds to the variability observed at 
short timescales, i.e., the high-frequency PSD; while long- 
duration consecutive monitoring with gaps corresponds to the 
variability observed at long timescales, i.e., the low-frequency 
PSD. In this paper we concentrate on constraining the under- 
lying high-frequency PSD. We generate artificial lightcurves 
using simulations with specified input PSD parameters. The 
lightcurves are then sampled according to patterns similar to 
real observations. 


2.1. Calculating periodogram 


Periodogram is calculated as modulus-squared of the Dis- 
crete Fourier Transform of a lightcurve with an appropriate 
normalization. Suppose a lightcurve x; is T long and evenly 


sampled with interval AT at discrete times t; (i = 1,2,...,7), 
then the periodogram P(f) normalized with A aps is: 
PCF) Aal DT ED] (1) 


2 
n 
IDET? = (3 xe? 
i=1 


n 2 n 2 
= { ZC xcosenpi} + { ZC asineansiey} 
i=l i=l 
(2) 


where f; = j/nAT with j= 1,2,...,n/2. [1/nAT,1/2AT] 
is the temporal frequency range covered by the observation, 


where 1/2AT is so-called Nyquist frequency. We use abso- 
lute normalization Au =2AT /n so that 
integrating the periodogram yields rms? (see Section]for the 
discussion on different normalization choices). 


2.2. Simulating artificial lightcurves 


We use the algorithm of (1995) to gen- 
erate artificial lightcurves. As plotted in Fig. [I](i.e., the solid 
line), a broken power law, 


Af *+A(U/Ty)*, f>fe 
P(A)= = 3 
S we (ik), rent © 


is used as the model PSD. fe is the break frequency that corre- 
sponds to the characteristic timescale TL. by foe = 1/T.. While 
the low-frequency logarithmic slope £ is fixed to 1.0 through- 
out the paper, the high-frequency logarithmic slope o varies 
around 2.0 in the simulations. We add a constant component 
to the model to represent Poisson noise. The timescale where 
the power law component equals the white noise component 
is denoted as Tw. A is an arbitrary constant so that our simu- 
lated lightcurves are in arbitrary units. 

We then multiply a complex number (x + iy)/ V2 to the 
model PSD at every frequency, where x and y are drawn from 
the standard normal distribution. The zero frequency com- 
ponent is set to zero, and the components along negative fre- 
quency axis are set to the conjugate of corresponding pos- 
itive frequencies (see also Appendix B of 
for another equivalent way to obtain a randomized PSD). A 
subsequent inverse Fourier Transform results in an artificial 
lightcurve as a realization of the underlying PSD. We note that 
there is a new algorithm of time series simulation developed 
recently Geesse (2013), which is further 
discussed in Section|5] 


2.3. Red-noise leakage and aliasing 


Astronomical observations are time-limited and can only 
record a tiny fraction of the effectively infinitely long flux of 
AGNs, which means to multiply the continuous lightcurve by 
a rectangular function. Such an operation is equivalent to con- 
voluting the PSD with the Fourier Transform of the sampling 
rectangular function (van der Klisl1988), which is a sinc func- 
tion with strong sidelobes (see Fig. 3 of Max -Moerbeck et al] 
KuEI Convolution transfers power from one band to its 
vicinity, resulting in severe red-noise leakage when there is 
a significant amount of power below the observed frequency 
range in, e.g., the case of a red-noise process. 

Aliasing, on the other hand, happens when the observation 
is gapped, which means the detector is not recording the pho- 
tons from the source for some reason, and the power above the 
Nyquist frequency folds back. In real observations detectors 
integrate the flux over some smallest time interval AT, which 
essentially smears out the high-frequency variability and pre- 
vents the power from folding back. Moreover, the intrinsic 
AGN variability drops quickly at high frequencies, where the 
white noise of observation errors always dominates. There- 
fore, aliasing can flatten the low-frequency PSD to some de- 
gree, but is usually out of concern in the high-frequency PSD. 

We show these two systematic biases of periodograms in 
Fig. |2| In the left panel, we generate an artificial lightcurve 
(black, original) 4 x 10°s long with a realistic PSD model of 
a=2.4, 6=1.0, T,= 10°s, and Ty = 50s (annotated in the right 
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panel) that is motivated by real AGN observations, which rep- 
resents a fully sampled ideal lightcurve (AT = 1s). Then 
we evenly pick one out of every 200 samplings of the ideal 
lightcurve to obtain a discontinuously sampled lightcurve 
(red, 1/200 cadence), which is 200 times undersampled. We 
also divide the original lightcurve into 200 segments, and 
choose a random one as a time-limited lightcurve (blue, 1/200 
duration) so that it remains the original sampling cadence 
but is 200 times shorter. The periodograms of these three 
lightcurves are calculated and plotted in the right panel with 
the colors of periodograms corresponding to the colors of 
lightcurves in the left panel. 

As shown in the right panel of Fig. the frequency 
ranges covered by the red and blue periodograms are nar- 
rower than the black periodogram at the high-frequency and 
low-frequency ends, respectively, which is expected. It is ev- 
ident that the black periodogram well represents the under- 
lying (model) PSD, but the red and blue ones are clearly af- 
fected by aliasing and red-noise leakage, respectively. Alias- 
ing flattens the PSD of the discontinuously sampled lightcurve 
at the high-frequency part, i.e., the red periodogram is located 
slightly above the black periodogram around IO "Hz (i.e., be- 
ing flatter), but the low-frequency part (<, 10~*Hz) is unaf- 
fected as the red and black periodograms overlap perfectly. 
The blue periodogram has an overall higher normalization 
and a significantly tighter dispersion compared with the black 
periodogram, which indicates that, due to red-noise leakage, 
the periodogram of a short continuous observation does not 
well represent the high-frequency model PSD of a red-noise 
process. At the highest frequencies (> 10~'Hz), white noise 
dominates for both the blue and black periodograms. 


3. NORMALIZED LEAKAGE SPECTRUM 
3.1. Bias Distribution 


To further demonstrate the ensemble effect of red-noise 
leakage, in panel (a) of Fig.|3) we calculate in linear coordi- 
nates the mean (shown as blue curve) and standard deviation 
(shown as red curve) of the periodograms of all the 200 seg- 
ments of the original lightcurve, and plot them together with 
the high-frequency part of the model PSD (shown as black 
curve). According to Fig. Bh, the mean spectrum becomes 
flatter than the model PSD that is assumed to be in the form 
of 1/f?* for this demonstration, due to the significant con- 
tribution of the leaked power that distributes in the form of 
1/f? (see|Gonzalez-Martin & Vaughan|2012 
and reference therein). When there is no red-noise leakage, 
the priodogram follows a x?(dof = 2) distribution (Oe. Chi- 
squared distribution with two degrees of freedom) around 
the continuous PSD function at every frequency 
[1988): suppose that the underlying PSD is a power law with 
a slope q, then the periodogram of a lightcurve is 


PRAE, (4) 
where q; follows x?(dof = 2) whose standard deviation is 
equal to the expectation Oe, 100% dispersion). However, 
when there is red-noise leakage, the periodogram approxi- 
mately becomes 

PUD) =AGOLARG, (5) 
where R represents the amplitude of the leaked power that is 
random (thus different) from a lightcurve to another. In other 
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FIG. 2.— Left: A simulated lightcurve (in an arbitrary unit) that is 4 x 10°s long and plotted in black, with the parameters of the corresponding PSD model 
listed in the right panel. This original lightcurve is resampled with a 1/200 cadence by taking one data point out of every 200 and is plotted in red. Note that 
the red lightcurve is exactly on top of the black lightcurve, so we provide a zoomed-in inset for better visualization of such a 1/200 resampling pattern. We also 
choose one random segment 2 x 10*s long from the black lightcurve (i.e, 1/200 in length of the original black one) to represent a time-limited observation and 
plot it in blue (highlighted by a downward blue arrow). Right: The periodograms of the three lightcurves are plotted in the same colors as the lightcurves. It is 
clear that the red and blue periodograms suffer from aliasing and red-noise leakage, respectively. 


words, red-noise leakage makes the distribution of power 
spectrum around the mean deviate from an assumed 7(dof = 
2) distribution, and the resulting PSD distortion is systematic 
across all frequency bins; such a different behavior of the red- 
noise distorted periodogram is indicated by the fact that the 
standard deviation curve lies slightly (yet clearly) above the 
mean curve (see the inset of Fig. Bh). The above deviation is 


caused by the fact that the leaked power is not equally dis- 
tributed over different segments (Uttley et al.||2002). An ad- 
ditional relevant issue, as posed in (1997), is the 


shift of the bend between the power law and white noise com- 
ponents, i.e., 1/Ty. If Ty is treated as the minimum intrinsic 
variability timescale above the level of the observational fluc- 
tuation background, it tends to be mis-estimated from a red- 
noise distorted periodogram and the resulting shift could be 
as large as one order of magnitude. 

The gold shaded area in Fig. Bh, enclosed by the standard 
deviation curve and the model PSD curve, roughly shows the 
“bias” introduced by red-noise leakage. We calculate the size 
of this shaded area in the logarithmic coordinates at different 
frequencies and plot the derived bias curve in the same color 
in panel (c). The bias curve Oe, the gold curve in Fig. BẸ) 
rises first because of the different slopes for the model PSD 
and the standard deviation curve, then peaks around 1 /Tw, and 
finally falls to zero. Apparently, a zero-bias curve corresponds 
to the ideal case without any red-noise leakage. As a compar- 
ison, we repeat the above procedure and carry out another set 
of simulations by changing only the value of a to 2.0, i.e., 
keeping all the other parameters of the model PSD the same 
as those adopted in Figs. Bh and We plot the correspond- 
ing results in Figs.Bb and[BU. It appears that (1) the standard 
deviation curve and the mean curve overlap largely although 
the former being somewhat higher than the latter (see the inset 
of Pe Bit. and (2) the bias curve is basically flat and falls to 
zero without any peak. These results are expected given that 
the model and biased mean periodograms have the same slope 
(1.e., a = 2.0). 

If we want to uncover the underlying PSD from the red- 
noise distorted periodogram, a seemingly obvious way would 
be to subtract the bias from the observed distorted peri- 


odogram. Such an approach will, unfortunately, not work in 
practice because we cannot determine the bias without know- 
ing the underlying PSD. However, what we do know is that a 
power spectrum unaffected by red-noise leakage should fol- 
low a y?(dof = 2) distribution, whose mean is identical to 
its standard deviation; therefore, the model PSD in this case 
stands for not only the expectation of a periodogram that is a 
realization of the model PSD, but also the standard deviation 
curve. Due to red-noise leakage, the power of a periodogram 
calculated at different frequencies are no longer independent 
(Done et al][1992), which results in the much tighter disper- 
sion of periodogram as shown in the blue spectrum in the right 
panel of Pie DI in stark contrast to the black spectrum. On the 
other hand, the standard deviation curve lies above the mean 
curve, which is shown in Figs. and and indicates an 
additional dispersion between different segments due to red- 
noise leakage. The above arguments reveal that, as far as the 
high-frequency part of the PSD is concerned, even though we 
cannot distinguish directly between the leaked power of the 
long-timescale variability and the intrinsic power of the short- 
timescale variability, we can utilize their different behaviors 
of periodogram dispersion to trace the bias, thus uncovering 
the underlying high-frequency PSD. 


3.2. Calculating normalized leakage spectrum 


Suppose there are M evenly-sampled lightcurves of equal 
length, and these lightcurves are either non-overlapping seg- 
ments of a single continuous observation or sporadic ob- 
servations. We then calculate the corresponding M peri- 
odograms and bin over frequencies logarithmically accord- 
ing to the method of [Papadakis & Lavvenca (1993) with a 
bin size of N, i.e., for each periodogram, we group every N 
consecutive frequencies, take the mean of powers logarith- 
mically as the power at the logarithmic mean frequency of 
those N frequencies, and denote the result as log[P;( where 
i=1,2,...,M. Note i ie a (1993) re- 
quired N > 20 so that the distribution of the error of the binned 
periodogram is more Gaussian in order to fit the binned peri- 


odogram correctly, which is not required in our case because 
we do not intend to fit the binned power spectrum and actu- 
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FIG. 3.— (a) The mean and standard deviation of the 200 periodograms (plotted as the blue and red curves, respectively) and the high-frequency model PSD 
(plotted as the black curve). The gold shaded area between the red and black curves shows the bias caused by red-noise leakage. The zoomed-in inset presents 
more clearly the mean and standard deviation curves. (b) Same as panel (a), but with the parameter of a = 2. (c) The size of the gold shaded area shown in panel 
(a) calculated at different frequencies in logarithmic coordinates (plotted as the gold curve), i.e., the bias curve, with the y = 0 dashed-dot line corresponding to 
the zero-bias curve (i.e., the ideal case without any red-noise leakage). The green curve is the corresponding NLS of the 200 periodograms (see Section|3.2|for 
details), with the y = 1 dashed-dot line corresponding to the ideal NLS without any red-noise leakage. (d) Same as (c), but corresponding to the case of panel (b). 


ally the (dof = 2) distribution assumption does not hold in 
the biased periodogram of real data. With the M binned pe- 
riodograms, we calculate the standard deviation at every fre- 
quency and denote it as std{log[P(f)]}. Subsequently, we de- 
fine a novel observable quantity, the normalized leakage spec- 
trum CO, i.e., NLS, as 


_ std{log[ PA] 
EO gapo MN 
i 
SITE Sg (7) 


std{log[P]} is solely determined by N and is an accurate esti- 
mation of the standard deviation of the binned periodogram 
if the red-noise leakage is totally absent, for the case of a 
single power-law distributed PSD that is valid for the high- 
frequency PSD above the break frequency as shown in Fig. [1] 
(cf. Eq. 20 of/Papadakis & Lawrence) (1993) for std{log[P]}). 
NLS is dimensionless and describes the severity of red-noise 
leakage across different frequencies. There is an obvious fact 
that L(f) > 1, with the following indications: when CO) ~ 1, 


the observation-derived std{log[P( ET (with fluctuations) is 
consistent with the theoretical estimation std{log[P] }, reflect- 
ing that the red-noise leakage at the frequency f is not severe; 
and £(f) > 1 is expected when the red-noise leakage is severe 
at the frequency f. 

We calculate the NLS of the artificial lightcurves generated 
previously in Figs. Bh and Bp, and plot them in green color 
in Figs. Bt and BU. In this case, we have M = 200 that corre- 
sponds to the 200 segments of the original artificial lightcurve 
(see Section|3.1), and we adopt a bin size of N = 5, resulting 
in that the NLS covers a narrower frequency range than the 
corresponding bias curve. It is clear that the NLS follows the 
trend of the bias curve tightly irregardless of the value of N 
(see Section |3.3), indicating that the NLS does describe well 
the effects of red-noise leakage. We note that the shape of the 
NLS depends on N (see Section[3.3) and is therefore not nec- 
essarily exactly the same as that of the bias curve, although 
it happens to seem that the NLS in Figs. Bb andBHU, if shifted 
vertically by ~ —1.0, will overlap with the corresponding bias 
curve. 
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FIG. 4.— Dependence of NLS on the bin size of binned periodogram. The 
horizontal solid line corresponds to the ideal case without any red-noise leak- 
age. 


3.3. Dependence on N 


The systematic bias caused by red-noise leakage depends 
on the underlying PSD and the length of the observed 
lightcurve. As demonstrated above, the NLS well describes 
the red-noise leakage of the observations given some specific 
sampling pattern. According to Eq.[6Jand Eq.[7] the NLS also 
depends on N, i.e., the bin size of binned periodogram. Us- 
ing the 200 segments of the original lightcurve simulated in 
Fig. [2] we calculate again the NLS with different N values 
and plot the results in Fig.[4] The overall shapes of NLS with 
different N are not the same. The estimation of std{log[P]}, 
which is the denominator of the NLS, decreases with increas- 
ing N. The larger N is, the more the NLS deviates from the 
black y = 1 horizontal solid line, making red-noise leakage 
seem more severe, even though its severity is actually not 
determined by N. The increase of N not only amplifies the 
NLS, but also suppresses the frequency range covered by the 
NLS, e.g., the case of N = 50 corresponds to the narrowest 
frequency range of the NLS in Fig.|4} A point worth noting 
is that the rising, peaking, and declining trend is identical for 
the NLS with different N values, indicating that, irregardless 
of the N values, the NLS is a reliable indicator of red-noise 
leakage. 


3.4. Dependence on PSD 


We use artificial lightcurves from simulations to exam- 
ine how the NLS depends on PSD. For each simulation, we 
change only one parameter out of a, Tọ, and Tw while keeping 
the remaining PSD parameters the same as those adopted in 
the right panel of Fig. 2] (ie., a= 2.4, T= 10°s, Ty= 50s, and 
8 = 1.0). The additional default parameters are T = 4 x 10°s, 
AT = 1s, M = 200, and N = 20. The results are shown in 
Fig. [5] 

In Fig.[5h, only o is changed for each NLS. The response of 
the change of NLS to the change of a is consistent with that 
presented in Figs. Bt (a = 2.4) and BH (a = 2.0), where the 
NLS follows closely the trend of the bias curve. The leakage 
is more severe for a larger a value. Note that red-noise leak- 
age always occurs as long as the lightcurve is not long enough 
to cover all the variation power and thus shows some longterm 
variation trend. Even for the case of a = 1.8, the NLS still dis- 


plays a sign of red-noise leakage, which is nevertheless much 
less severe than the cases of œ > 2. In general, the depen- 
dence of NLS on a is as follows: when a > 2, the NLS peaks 
around f = 1/Tw with larger a leading to higher NLS and 
steeper rising before the peak; when a = 2, the NLS is ba- 
sically flat below 1/T,, and falls to 1 without any peak; and 
when a < 2, the NLS declines monotonically until it flattens 
to 1. Itis also interesting to note that, below 1/T,,, when a > 2 
the higher-frequency part of periodogram (around 10~*?Hz) 
suffers more from red-noise leakage than the lower-frequency 
part of periodogram (around 10~°-°Hz), i.e., the NLS slope is 
positive; while the trend is reversed when o < 2 such that the 
lower-frequency part of periodogram suffers more from red- 
noise leakage than the higher-frequency part of periodogram, 
i.e., the NLS slope is negative. The above results demonstrate 
clearly that the NLS slope (below 1/T,,) has a sensitive and 
continuous dependence on the high-frequency slope of the un- 
derlying PSD (i.e., a; see Pa 

In Fig. Bb, we change the characteristic timescale T, and 
examine the dependence of NLS on it. Apparently, T, does 
not affect the slope of NLS below 1/T,y. Rather, T, affects 
the height of the peak that indicates the severity of red-noise 
leakage. Larger T, values indicate that there is more power 
below the observed frequency range, thus corresponding to 
larger peak heights (i.e., more severe red-noise leakage). 

In Fig. Bt, we show the dependence of NLS on Tw, which 
represents the minimum intrinsic variability timescale that is 
measurable. As the predominance of white noise starts above 
1/Tw, the NLS peak is always around 1/T, for different Ty 
values. Furthermore, because of the definition of Tw, the ex- 
act NLS peak shifts to the left of 1 /Tw by about 0.1 dex (see 
Figs.[[andB). Simply like 7., Tw does not affect the steepness 
of NLS below 1/T,. When the S/N of the data is low, i.e., 
the measurable power of intrinsic variability is weak com- 
pared with the dominant white noise in the high-frequency 
PSD (e.g., the case of Tw = 1600s), there appears no obvious 
NLS peak within our interested frequency range, even though 
a=2.4 > 2 in this case. 

We conclude from Fig. [5]that the NLS depends sensitively 
on the underlying PSD in the following way: a determines 
the steepness of NLS Oe, the NLS slope below 1/T,); Tw 
determines the location of the NLS peak; and both a and T, 
affect the height of peak that indicates how severe the red- 
noise leakage is. These result in the idea that we can actually 
use the NLS in conjunction with Monte Carlo simulations to 
uncover the underlying PSD of a source, which is covered 
only by some sporadic observations (i.e., the typical realistic 
observational situation) and whose PSD is therefore subject 
to significant red-noise leakage. 


4. CONSTRAINING THE UNDERLYING PSD 


In this section we demonstrate how to constrain the under- 
lying PSD utilizing artificial lightcurves made from simula- 
tions. In particular, we are interested in these three parame- 
ters, a, Te, and Ty. 


4.1. Mock observations 


To generate an artificial lightcurve, we adopt a = 2.3, Tẹ, = 
5 x 104s Oe, log(f,) = —4.70), and Tw = 80s as the input un- 
derlying high-frequency PSD, i.e., the one that we aim to un- 
cover eventually. We set the remaining parameters to the de- 
fault values Oe, 8 = 1.0 and AT = 1s). Additionally, to make 
our mock observations more realistic, we make the model 
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FIG. 5.— Dependence of NLS on the underlying PSD. The default PSD parameters are detailed in Section The black y = 1 horizontal lines correspond 
to the ideal cases without any red-noise leakage. (a) Dependence of NLS on the high-frequency slope a of the underlying PSD. (b) Dependence of NLS on the 


characteristic timescale Te. (c) Dependence of NLS on Ty. 


PSD break to another white-noise level below 2 x 10Hz 
De, log(2 x 10-8) = -7.70, being a factor of 10? lower than 
the above high-frequency break; see Fig. [I] for a schematic 
illustration), given that a physically meaningful PSD has to 
flatten in some way to make power converge 
[2003). Indeed, such a low-frequency break in the X-ray 
PSD of AGNs has been observed in, e.g., the narrow line 
Seyfert 1 galaxy Ark 564 (McHardy et all (2007) We note 
that this is the only simulation in this paper where we use 
a double broken power-law PSD instead of a single broken 
power-law PSD. From this simulation, we generate an artifi- 
cial lightcurve 1 x 108s long, and divide it into 5000 segments 
of equal duration, each lasting 2 x 104s. 

We randomly choose 20 out of the entire 5000 segments to 
mimic 20 sporadic real observations. We show the 1 x 108s 
lightcurve in the top panel of Fig.|6] where the red triangles in- 
dicate the locations of the 20 random mock observations. The 
NLS is calculated for these 20 short lightcurves (i.e., M = 20) 
and referred to as Mock NLS I. As Mock NLS I is sample 
volume limited, we also calculate another NLS for compari- 
son using the entire 5000 segments (i.e., M = 5000), in order 
to examine whether Mock NLS I still leads to any systemati- 
cally biased result. We refer to this later NLS as Mock NLS 
II, which represents the expectation of Mock NLS I. If we in- 
crease the number of segments included in calculating Mock 
NLS I, it will eventually converge to Mock NLS II. Mock 
NLS I and II are shown in the bottom panel of Fig. [6] Mock 
NLS II is much smoother than Mock NLS I because it is made 
using all the data available Oe. 5000 segments) and thus 
is subject to much less statistical fluctuations. The fact that 
Mock NLS I and II behave very similarly (i.e., they basically 
follow each other closely) indicates that Mock NLS I does 
not suffer from any appreciable systematic bias even though 
it makes use of only a very small portion (i.e., 20/5000=0.4%) 
of the data. 


4.2. Model NLS 


We perform a large amount of additional simulations to ob- 
tain model NLS that can be compared with the above Mock 
NLS I and II. For each model PSD with a different set of pa- 
rameters (i.e., a, To, and Tw), we generate a lightcurve 4 x 10’s 
long, being shorter than 1 x 108s adopted in the above simula- 
tion of mock observations, because the duration of a real AGN 
lightcurve, which is effectively infinitely long, should not be 
considered as known. If we generate the lightcurve with the 


same length of 1 x 108s, our results would only be better, at 
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FIG. 6.— Top: We simulate a lightcurve 1 x 108s long and divide it into 
5000 segments of equal duration, each lasting 2 x 104s. We then randomly 
choose 20 segments to mimic 20 sporadic observations, whose locations are 
indicated by the red triangles. Bottom: The NLS of the above 20 mock ob- 
servations, referred to as Mock NLS I, is plotted as the red curve. The thick 
grey long-dashed line indicates the best-fit linear model to the part of Mock 
NLS I below 1/Ty, with the best-fit slope annotated. Another NLS, referred 
to as Mock NLS IL, is calculated using the entire 5000 segments and plotted 
as the blue dashed curve. 


the expense of longer computational time. Another difference 
is that the smallest time interval AT is 5s here, not as fine 
as in the mock observations (AT = 1s), because white noise 
dominates the PSD at timescales smaller than Tẹ, the NLS 
is characterless at the highest frequencies (being constant at 
a value of 1) while the number of data points there is large. 
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FIG. 7.— Normalized histogram of Xs model /Ndata and probability den- 


sity functions of two reduced Chi-squared distributions with 9 degrees of 
freedom and 1 degree of freedom respectively. The normalized histogram 


of XLS. adei /Naata, Which is calculated using the Naata = 9 data points in 
our example (see Pe DU. does not follow the probability density function 
of x?,(dof = 9). Instead, due to the fact that the NLS at different frequencies 
is not independent, ës model/ Naata follows closely the probability density 


function of x2, (dof = 1). 


The adoption of a larger time resolution significantly reduces 
computational time. 

The lightcurve is then divided into 2000 segments of equal 
duration, so each segment is also 2 x 10s long. They are 
subsequently arranged into 100 groups, each of which con- 
tains 20 segments (i.e., M = 20), the same number as the mock 
observations. The above procedure eventually results in 100 
NLS Ge, each group of 20 segments resulting in a NLS), 
whose mean and standard deviation at every frequency are de- 
noted as Lmodei(f) and ALmoaei(f). Thus, for each given set 
of model PSD parameters, we can have a set of Linodei(f) and 
ALmodel(f), which we refer to as the model NLS and its stan- 
dard deviation, respectively. By comparing the model NLS 
with the mock NLS (i.e., Lmock(f)), we can examine whether 
the model PSD matches the underlying PSD of the mock ob- 
servations. 


4.3. Assessing the goodness of ‘fitting’ 


To determine how well the model NLS fits the mock NLS, 
we define a Ze XNLs,# analogous to xq, defined in 


(2002): 


fmax Po ua j 
5 SE 
Z Emoia AD | (8) 
XNLS, EN ( AL moaei(f) 


where the asterisk stands for mock or model, and fmin and fmax 
are the minimum and maximum frequencies of the binned pe- 
riodogram of a lightcurve 2 x 104s long. Lmock(f) is derived 
from the mock observations, and in our case it is either Mock 
NLS I or Mock NLS II, resulting in a corresponding eer Gah 
value. 

The errors on NLS, being largely unknown, will in no way 
follow a simple Gaussian distribution. However, using the 
100 NLS available, we can determine roughly the distribu- 
tion of is. model bY putting each of the 100 NLS into Eq. 


and obtaining 100 different cae mode] Values. We can then 
estimate, for each given set of model PSD parameters, the 


probability of POLS, model > ës mock)> 1-€., the total cases 
of ges, mode! being larger than ës mock divided by 100. The 


larger P(XÃLs. model > XÅLS. mock) İS, the better the correspond- 
ing model NLS fits the mock NLS, i.e., the better the cor- 
responding model PSD describes the underlying PSD of the 
mock observations. In this exercise, the implicit assumption is 
that the unknown distribution of xis, mock 1S the same as that 


of XALS, mode) Whose PSD parameters are the same as those 
of the underlying true PSD. We present the normalized dis- 
tribution of ët model/ Naata in Pie. DÉI which shows that, due 
to the fact that the NLS at different frequencies is not inde- 
pendent, the “effective” degree of freedom of XALS, model Te- 
mains | despite of Naata = 9 in this demonstration. Note that 
we do not use bootstrapping as in[Uttley et al] to deter- 
mine the distribution of vu s, because the NLS follows a non- 
Gaussian distribution, and the NLS at every frequency is not 
independent (just like the case of the red-noise distorted peri- 
odogram). As such, bootstrapping will broaden the spread of 
XNLS and result in large POXRs, model 7 XNLS, mock) values for 


many model PSDs. Actually, due to the same reason, AL model 
is larger than the actual spread of the NLS. 


4.4. Freeing one parameter 


For the three model PSD parameters of a, To, and Tw, we 
fix two of them to the assumed values that are used to gen- 
erate mock lightcurves in Section and search for the best 
value for the remaining “unknown” parameter. The results 
are shown in Fig. land Table [I] In real applications, these 
fixed parameters could be determined from low-frequency 
PSD modelings and/or other ways of constraints. We do not 
provide the errors on the parameters in the case of Mock NLS 
II since it comes from a much larger set of samples (M = 5000) 
and is therefore expected to have smaller corresponding errors 
on the parameters than those in the case of Mock NLS I. 


TABLE 1 
RESULTS OF CONSTRAINING THE UNDERLYING PSD WHEN ONLY ONE 
PARAMETER IS FREE 


Free Mock NLS I Mock NLS II Input Model 
Parameter (Figure Panel) (Figure Panel) Parameter 
a 2.3200 (8p) 2.28 (8b) 2.30 
a 2.34%): 59 (Oh) 2.28 (9b) 2.30 
log(fe)  —4.75*9-24 BW ` -4.74 (BH) -4.70 
Tw 767338 Bh 83s BP 80s 


4.4.1. Freeing a 


We choose 10 different o values that are evenly spaced be- 
tween 2.1 and 2.6 (i.e., Aa ~ 0.044), given that an AGN PSD 
has a typical value of a > 2. In practice, without performing 
any simulation, one can easily determine a > 2 or a < 2 by 
checking whether there exists a peak in the NLS derived from 
observations (see Section and Fig. 5h). We show the con- 
straints on o in Figs.[8b and[8b for the cases of Mock NLS I 
and II, respectively (also see Table[I}. It is clear that, in both 
cases, the derived a = 2,3210: and a = 2.28 are in excellent 
agreement with the input a = 2.30 of the model PSD. 

As shown in Figs. and [6] the NLS slope below 
1/T,, uniquely reflects the steepness of the underlying PSD 
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FIG. 8.— Constraining the underlying model PSD when only one parameter is unknown (i.e., being free). The results for the case of Mock NLS I are shown 
in panels (a-c), and those for the case of Mock NLS II are shown in panels (d—f). In each panel, the x-axis indicates the specific unknown parameter to be 
constrained; the y-axis p stands for Pis nii” XALs mock) With the blue p curve being smoothed by a cubic function; the horizontal dashed line corresponds 
to 1— p = 0.68 and its intersections with the p curve indicate the lo range of the unknown parameter; and the best value and its lo range determined for the 


unknown parameter are denoted atop. 
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FIG. 9.— Constraining a of the underlying model PSD with a different method for the cases of (a) Mock NLS I and (b) Mock NLS IL, respectively. The 
intersections of the vertical red lines with the three percentile curves indicate the best a value and its 1o confidence interval (see Section for details). 


De, œ), irregardless of variations of T, and Tw. So we de- 
velop another method to constrain a, which was inspired by 
(2014). From the 100 NLS derived with 
each a value, we find three percentiles, i.e., 16%, 50%, and 
84% of the NLS slopes, plot them in Fig. D] interpolate the 
same percentiles for different a values (i.e., the blue solid 
and dashed curves), and finally draw the NLS slope of Mock 
NLS I or II (e.g., in Fig. (9h, we draw the vertical red line of 
slope=0.53 for Mock NLS I; this slope value is also shown in 
the bottom panel of Fig.[6). As shown in Figs. Dh and Däin, the 
intersections of the vertical red lines with the three percentile 
curves result in estimates of a and its Lo confidence interval. 


The derived a = 2.341910 for Mock NLS I and a = 2.28 for 
Mock NLS II are again in excellent agreement with the input 
a = 2.30 of the model PSD (also see "Table. 

Using the above two methods, we constrain well the a val- 
ues for the cases of both Mock NLS I and Mock NLS II, with 
the results for Mock NLS II being better and not obviously 
biased. 


4.4.2. Freeing Te 


We choose 15 different log( f+) values that are evenly spaced 
between —5.5 to —4.0 (i.e., Alog(f.) ~ 0.11). We show the 
constraints on log( fz) in Figs.[8h and[8H for the cases of Mock 
NLS I and Mock NLS II, respectively (also see Table[I). It 
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Fic. 10.— Contour plots of constraining simultaneously a and Te of the 
underlying model PSD for the cases of Mock NLS I and II, when Ty is fixed to 
80s. The yellow, green, and cyan contours indicate 10, 20, and Ze confidence 
intervals of œ and log(f-), whose true values in the input model PSD are 
marked by black crosses. The red cross represents an example to be displayed 
in Fig.[11|for further discussions. 


is clear that, in both cases, the derived log(f,) = —4.751) 33 
and log( f.) = —4.74 are in excellent agreement with the input 
log(fc) = —4.70 of the model PSD. 


4.4.3. Freeing Ty 


We choose 18 different Tw values that are evenly spaced 
between 20s and 140s (i.e., ATw œ 7s). We show the con- 
straints on Ty in Figs. [BẸ and|8F for the cases of Mock NLS I 
and Mock NLS II, respectively (also see Table[I). It is clear 
that, in both cases, the derived Ty = 761395 and Tw = 83s are 
in excellent agreement with the input Tw = 80s of the model 
PSD. 


4.5. Fixing one parameter 


In this subsection, we fix Tẹ to 80s and search for the best 
combination of T, and a values. We create a 20 x 10 grid 
of log(fc) and a, with log(f.) varying from —6.0 to —4.0 
(Alog(f-) ~ 0.105) and o varying from 2.1 to 2.6 (Aa ~ 
0.044), respectively. We show the simultaneous constraints 
on both T, and a using contour plots in Fig. [[0]for the cases 
of Mock NLS I and Mock NLS II. As shown in Fig. for 
both cases, we cannot constrain well T, and a simultaneously 
Oe, the constraints are not sufficiently stringent). The shapes 
of the lo, 20, and 30 contours are all banana-like, which is 
caused by the degeneracy among various T,-a combinations, 
i.e., a PSD with a larger T, value and a smaller o value and 
a PSD with a smaller T, value and a larger o value can have 
very similar and even indistinguishable red-noise leakage ef- 
fects. However, when we use the method of the NLS slope 
to constrain o as in Section|4.4. land Fig. [9] we still obtain a 
well-constrained value of a = 2.361?:12 for the case of Mock 
NLS I. Therefore, we are able to uncover the o value of the 
underlying PSD even though T, is not fixed or close to the 
input model value. We do not intend to constrain the three 
parameters (i.e., a, Te, and Tw) simultaneously. 


5. DISCUSSION AND PERSPECTIVE 


In this paper, our simulated lightcurves have arbitrary units, 
which are generated following the assumed corresponding un- 
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FIG. 11.— Comparison between the binned Mock NLS II and a model NLS 
with log(f-) =—6 and a = 2.15. Although the PSD parameters of this model 
NLS deviate significantly from those of the input model PSD (see the red and 
black crosses in the right panel of Fig.[10}, it appears that this model NLS is 
statistically consistent with Mock NLS II, i.e., our approach cannot rule out 
this particular model NLS. 


derlying PSDs (see Po. DÄ. We then use an absolute normal- 
ization Aas = 2AT/n for the calculation of a periodogram 
such that the integration of the periodogram equals rms? 
(see Section 2.1). Real AGN lightcurves have some inter- 
esting properties, e.g., the log-normal distribution of fluxes 
and the rms-flux relation in both the X-ray 
and optical bands, which calls 
for the use of a physically well-motivated rms-normalization 
Ams = 2AT ein in calculating the NLS based on real ob- 
servational data. In the case of using Ans, the NLS will fi- 
nally flatten at some value larger than 1 instead of flatten- 
ing at 1, which, however, does not indicate red-noise leak- 
age affecting the highest-frequency PSD, and is actually due 
to that the Poisson noise level and thus the Tw value vary 
for lightcurve segments with different mean fluxes. An ad- 
ditional normalization Apeahy = 2AT /xn 
can be considered as well because in this case the Poisson 
noise level is constant and the NLS falls to 1 at high fre- 
quencies. Recently, devel- 
oped a new algorithm of time series simulation, which com- 
bines the properties of PSD and probability distribution func- 
tion (PDF) in generating a single artificial lightcurve. Their 
lightcurves can produce the rms-flux relation if a log-normal 
PDF is adopted. They also re-randomized the fluxes to mimic 
Poisson fluctuations during observations. All these features 
SEH Ee Se 
ter way to generate artificial lightcurves. However, this al- 
gorithm is obviously much more computationally intensive. 
Therefore, we do not adopt it because we are only inter- 
ested in uncovering the underlying high-frequency PSD us- 
ing the basic NLS properties, and our simpler method of sim- 
ulating lightcurves suffices for our purpose. Another way 
to reconcile the seeming contradictories at choosing differ- 
ent normalizations is that one could take the logarithm of 
fluxes (Uttley et al]|2005; 2009) instead of using 
count rates or fluxes (Uttley & McHardy||2005) that are usu- 
ally adopted. The logarithms of AGN fluxes typically dis- 
tribute in a Gaussian form, thus one does not have to resort 


to the |Emmanoulopoulos et all (2013) method. In the ap- 


pendix, we carry out an additional set of simulations to inves- 
tigate the effects of log-normal flux distribution and Poisson- 
statistics sampled lightcurves Oe. features of real AGN data) 
on the performance of NLS, following the method outlined in 
(2005). We find that, with an appropriate choice 
of normalization, the NLS can also be used to effectively con- 
strain the high-frequency PSD of lightcurves with realistic 
features, which further demonstrates that the NLS is an ef- 
fective and consistent tool to study AGN PSD. 

We use a broken power law with a fixed value of 6 = 1.0 
as our base PSD model; nevertheless, our NLS approach can 
be applied to any arbitrary PSD model, e.g., the one with 
a smoothly bending power law or multiple Lorentzian com- 
ponents [2007). Furthermore, our ap- 
proach is applicable to both X-ray and optical timing observa- 
tions of AGNs. If 8 is set free, because it can affect the sever- 
ity of red-noise leakage as much as IJ. our approach cannot 
constrain D and T, at the same time. In other words, the NLS 
approach by itself cannot constrain more than one parame- 
ters of the low-frequency PSD simultaneously. An approach 
of constraining a broadband PSD is described in (Uitley et all 
(2002), who utilized lightcurves observed with different sam- 
pling patterns across different timescales and from different 
telescopes. However, both the (2002) approach 
and its recently revised version (Marshalll2015) do not make 
enough efforts to alleviate the issue of red-noise leakage at the 
high-frequency PSD, which is the focus of our work. 

We use an example in Fig. [I] to show that simply fitting 
Mock NLS II with the model NLS cannot well constrain two 
parameters (i.e., œ and Jost (1 of the input model PSD simul- 
taneously. Although the model of a = 2.15 and log(f.)= —6 
deviates significantly from the input model of a = 2.30 and 
log(fc)= —4.70 as shown in Fig. and one can even some- 
how visually distinguish the two NLS resulted from these two 
models in bg HU the model of o = 2.15 and log(f.)=—6 can- 
not be ruled out statistically, due to that larger o and T, val- 
ues both lead to more severe red-noise leakage Oe, there is 
some degeneracy among various combinations of o and T,). 
Because the NLS values at different frequencies are not inde- 
pendent, the error bars on the model NLS shown in Fig. 
are overestimated, being inappropriately large. However, if 
we can fix the value of either a or log( fe), the situation will 
be reduced to the case of constraining one parameter at a time 
where stringent constraints can be obtained (see Section[4.4). 
For example, we can first set T, free and nail down o using 
the approach of the NLS slope as presented in Section 
or, we can first constrain T, (or more generally speaking, the 
low-frequency PSD) using longterm monitoring observations. 

The DRW (or the first-order continuous-time autoregres- 
sive process, or the Ornstein-Uhlenbeck process) model 
is widely used to model quasar optical vari- 
abilities. Techniques based on the DRW model have been 
developed for reverberation mapping (Zu et al|[201 1), AGN 
selection (e.g., and many other appli- 
cations. The PSD of the DRW model is a Lorentzian centered 
at zero, being a power law with a = 2 at high frequencies. 
However, some high-quality data have shown PSD results de- 
viating from the expectation of a simple DRW model (e.g., 
2013. In Fig, Bh we 
show that the NLS slope provides a diagnosis on whether the 
high-frequency PSD is in the form of P(f) « LIT. So we can 
examine whether the simple DRW model can describe the ob- 
servations adequately by simply checking whether the NLS 
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slope is flat. 

The issue of red-noise leakage cannot be solved by improv- 
ing S/N of observations that can be easily achievable using 
some future large facilities. Higher S/N observations poten- 
tially allow for revealing the even higher-frequency PSD that 
dictates intrinsic AGN variabilities at shorter timescales, but 
as red-noise leakage becomes more severe at higher frequen- 
cies when a > 2, the PSD results will still be biased if not 
more severely. In fact, red-noise leakage is more of an issue 
of sampling, which is typically limited by the length of a night 
or the period of an orbit. Due to this reason, even the proposed 
next-generation X-ray timing observatories such as the Large 
Observatory for X-ray Timing (LOFT; [De Rosa et al] (2015)) 
and ground-based optical survey telescopes such as the Large 
Synoptic Survey Telescope (LSST), both characterized by un- 
precedented S/N, still have to confront the issue of red-noise 
leakage in the high-frequency PSD. In particular, the duration 
of continuous exposure of the sensitive Large Area Detector 
onboard LOFT is limited to ~ 3 ks, because there is usually 
a ~ 2 ks gap due to the occultation of the Earth in a ~ 5 ks 
orbital period. Fortunately, our approach can mimic the real 
sampling patterns of observations and uncover the underlying 
high-frequency AGN PSD without demanding alternation on 
observational data, thereby offering great help in casting light 
on the physical origins of AGN variabilities. 


6. SUMMARY & CONCLUSIONS 


In this paper, we focus on constraining the underlying high- 
frequency AGN PSD that is characterized by the three param- 
eters of a, T., and Ty (see Fig.[}, and is typically strongly dis- 
torted due to red-noise leakage (see the right panel of Fig. B). 
We develop a novel and observable NLS (see Section 
and Fig. B) that can describe sensitively the effects of leaked 
red-noise power on the PSD at different temporal frequen- 
cies. We utilize Monte Carlo simulations to show detailedly 
how an AGN underlying PSD determines the NLS when there 
is severe red-noise leakage (see Section and Fig. Di and 
thereby how the NLS can be used to uncover the underlying 
PSD (see Section äh. In particular, we find that, when con- 
straining only one of the above three parameters at a time, 
we can obtain stringent constraints on that parameter of inter- 
est. When constraining a and T, simultaneously, we obtain 
loose constraints on both parameters due to the degeneracy 
among various combinations of o and T,. However, using the 
method of the NLS slope still enables us to obtain stringent 
constraints on o, irregardless of various T, values. Further- 
more, in real applications of our NLS approach, we can often 
reduce a complicated case of constraining two parameters si- 
multaneously to a simple case of constraining one parameter 
at a time, by setting some parameter to some fiducial value 
through delving into all information provided by all observa- 
tions available. As such, our NLS approach can mimic the real 
sampling patterns of observations and uncover the underlying 
high-frequency AGN PSD, without demanding interpolation, 
end matching, windowing, or any other alternation on obser- 
vational data. Additionally, our approach is applicable to both 
X-ray and optical timing observations of AGNs. Therefore, 
our NLS approach appears to be a useful and effective tool to 
help probe the phenomenon and eventually the physical ori- 
gins of AGN variabilities in the course of the incoming era of 
time domain astronomy. 
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APPENDIX 


EFFECTS OF LOG-NORMAL FLUX DISTRIBUTION AND 
POISSON-STATISTICS SAMPLED LIGHTCURVES ON NLS 


120 ~ 
100 —— 1/200 duration 


Flux (arbitrary unit) 
EI 
E 


0.0 0.5 1.0 1.5 2.0 25 3.0 3.5 4.0 
Time (106 s) 


Power (Hz-!) 


With Poisson Noise 


No Poisson Noise 


10-6 10-4 10-2 100 10-6 10-4 10-2 100 
f Hz) f (Hz) 

FIG. A1.— Top: Lightcurve simulated following the [Uttley et al] 
method, which possesses the properties of log-normal flux distribution and 
rms-flux relation that are found in real AGN lightcurves. The blue lightcurve 
in the inset is a segment of 1/200 length of the black lightcurve. Bottom 
left: Periodograms of the above two lightcurves without added Poisson noise. 
Bottom right: Periodograms of the above two lightcurves with added Poisson 
noise, which behaves as white noise at the highest frequencies. All peri- 
odograms are normalized with A;ms. Red-noise leakage is apparent in both 
bottom panels. 


When utilizing our NLS approach, as noted in Section [5] 
there are additional choices of periodogram normalization 
(i.e., Arms and Ajeany, Other than Aabs) and some features of 
real AGN data to be further explored. In this appendix, we 
therefore carry out an additional set of simulations to inves- 
tigate the effects of log-normal flux distribution and Poisson- 
statistics sampled lightcurves (i.e., features of real AGN data) 
on the performance of NLS. 

Following the method of and using a 
slightly different PSD model from the one adopted in Sec- 
tion [2.3] and Fig. [2] we first generate an artificial lightcurve 
with the e oe fs algorithm that is briefly 
outlined in Section [2.2] We then take the exponential of the 
flux to produce the log-normal distribution and exclude the 
high-frequency white-noise component of the PSD model in 
the simulation. Specifically, the PSD model used here is in 
the form of 


Af™, e f>fe 
AGE ie) > F< fo 
where a = 2.4, 8 = 1.0, and fs = 10° Hz (i.e., compared to 


Eq.B] the only difference is the missing high-frequency white- 
noise term). Additionally, we sample the resultant lightcurve 


PU) = (Al) 


mmm No Poisson noise, Arms 
— Timmer & Koenig 95, Aabs 
— With Poisson noise, Arms 


mmm No Poisson noise, Arms 
— Timmer & Koenig 95, Aabs 


— With Poisson noise, ALeahy 


=m No Poisson noise, Arms 


— Timmer & Koenig 95, Aabs 


— With Poisson noise, Aabs 


3.5 3.0 2.5 2.0 1.5 1.0 0.5 
log[f (Hz)] 


FIG. A2.— Comparison of NLS of different lightcurves with different nor- 
malizations adopted. The NLS of the lightcurve with log-normal flux distri- 
bution but without either Poisson or white noise is normalized with Arms and 
plotted in black in all panels. The NLS of the lightcurve with log-normal flux 
distribution and Poisson noise is normalized in three different ways and plot- 
ted in red in different panels. For comparison, the NLS produced from our 
previous simulation using the simpler[Timmer & Koenig algorithm is 
plotted in blue in all panels. 


using Poisson statistics to produce the errors in real observa- 
tions. The lightcurve is 4 x 10° s long and AT = 1 s. The 
final lightcurve is plotted in the top panel of Fig. where 
the inset displays a 2 x 10+ s segment. Taking the exponen- 
tial of a lightcurve with typical rms only changes slightly 
the shape of its broad continuum power spectrum (see Ap- 


pendix B of for a demonstration). De- 
spite of being generated with a nearly identical power spec- 
tral shape (see Eq. and Eq. B] for the slight difference 
between them), the lightcurve here is quite different from 
the lightcurve shown in Fig. [2| due to the usage of differ- 
ent simulation algorithms: the former displays multiple flares 
that are largely above a non-flaring (“quiescent”) flux level, 
being very similar to real AGN lightcurves; while the lat- 
ter displays both positive and negative fluctuations of simi- 
lar amplitudes around the mean flux level. The periodograms 
of the whole lightcurve (4 x 10° s long) and the segment 
lightcurve (2 x 10* s long), without and with added Poisson 
noise, are shown in the bottom-left and bottom-right panels 
of Fig.[A1] respectively. These periodograms are normalized 
with Auen = 2AT (Ein. where x represents the corresponding 
mean value of each of the above two lightcurves. It is clear 
that, at the highest frequencies above ~ 10-2 Hz, Poisson 
noise affects the periodograms in a similar way as white noise 
does (cf. the right panel of Po D 

We then cut the 4 x 10° s lightcurve into 200 segments of 
equal length, calculate the NLS with different choices of nor- 
malization, and plot these NLS in Fig. In each panel 
of Fig. we plot the same NLS of the lightcurve without 
added Poisson noise in black (i.e., no Poisson or white noise) 
and use only Ams; for comparison, we also plot the same NLS 
in blue that is generated from our previous simulation with the 
exactly same configuration except for adding white noise and 
is normalized by Aabs = 2AT /n (see the NLS in panel (c) of 
Fig. B). Without added (Poisson) noise, the lightcurve does 
not contain any frequency-independent variation, so its NLS 
(i.e., the black one) goes straight up without falling down, due 
to that the bias curve goes straight up if there is no noise (see 
Figs. Bh andBb); nevertheless, this NLS slope is still consis- 
tent with the slope of the blue NLS that is from the simpler 
simulation with the algorithm. In 
Fig. we also use three different normalizations to calcu- 
late the NLS of the lightcurve with added Poisson noise and 
plot them in red. In the top panel, we use A,m; for the red 
NLS, which has a slope consistent with that of the other two 
NLS and falls to a level above 1.0 (rather than being 1.0 for 
the blue NLS) after Poisson noise starts to dominate the power 
spectrum at high frequencies. In the middle panel, as we use 
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Ateahy = 2AT /xn to calculate the periodogram, the red NLS 
falls and flattens to 1.0 eventually, but its slope is slightly flat- 
ter than the other two NLS (note that a straightforward com- 
parison between the slopes of the red and black NLS can be 
made as they are from the same set of simulations). In the bot- 
tom panel, we use Aap; for the red NLS, which lies above the 
blue NLS at all frequencies and above the black NLS before 
falling down, and has a flatter slope than the other two NLS 
(compared to the middle panel, the degree of the red NLS be- 
ing flatter is more severe). 

It is not surprising to see the above dependence of NLS 
behavior on the different choices of normalization. As we 
know, the absolute power of Poisson noise is proportional to 


x (Vaughan et al/|2003) and the intrinsic variation amplitude 
is proportional to x* due to the rms-flux relation (Uttley et al! 
(3005) 


, where x is the local mean of each segment lightcurve. 
Using Arms and Ajeahy cancels out the factors of x associated 
with the intrinsic variation amplitude and the absolute power 
of Poisson noise, respectively; as the factors of + cannot be 
canceled out simultaneously, the red NLS is lifted up either at 
high frequencies (see the top panel) or low frequencies (see 
the middle panel). While using Aabs cannot cancel out any 
dependence on the mean flux level, the red NLS is lifted up 
at all frequencies compared to the blue NLS (see the bottom 
panel). 

Based on the above analysis and the effectiveness of 
the method of using the NLS slope to constrain the high- 
frequency PSD slope as demonstrated in Sections /4.4. 1} and 
we make the following recommendations regarding the 
usage of our NLS approach: (1) it is fine to adopt Aabs when 
dealing with lightcurves without realistic features (i.e., the 
blue NLS); (2) it is fine to adopt Arms or even ALeahy When 
dealing with lightcurves with realistic features Oe, the red 
NLS in the top and middle panels); and (3) it is risky to 
adopt Aaps when dealing with lightcurves with realistic fea- 
tures (i.e., the red NLS in the bottom panel). Finally, we con- 
clude that, with an appropriate choice of normalization, the 
NLS can be used to effectively constrain the high-frequency 
PSD of lightcurves with or without realistic features, which 
demonstrates that the NLS is an effective and consistent tool 
to study AGN PSD. 
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